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Abstract 

We show that the satisfiability and finite satisfiability problems for the 
two-variable fragment of first-order logic with counting quantifiers are 
both in NEXPTIME, even when counting quantifiers are coded succinctly. 



1 Background 

The two- variable fragment with counting quantifiers, here denoted C 2 , is the set 
of function-free, first-order formulas containing at most two variables, but with 
the counting quantifiers 3<c, ^>c an d 3=c (for every C > 0) allowed. The 
satisfiability problem, Sat-C 2 , is the problem of deciding, for a given sentence <j> 
of C 2 , whether </> has a model; the finite satisfiability problem, Fin-Sat-C 2 , is the 
problem of deciding, for a given sentence <p of C 2 , whether <fi has a finite model. 
It is well-known that C 2 lacks the finite model property; hence Sat-C 2 and Fin- 
Sat-C 2 do not coincide. The decidability of Sat-C 2 was shown independently 
by Gradel, Otto and Rosen [2], and by Szwast and Tendera For a more 

general survey and some recent extensions, see Gradel and Otto £Q and Otto [j]]. 

When discussing the computational complexity of these problems, it is im- 
portant to specify how the sizes of numerical quantifier subscripts are measured. 
Under unary coding, a counting subscript C is taken to have size C; under binary 
coding, by contrast, the same subscript is taken to have size logC. In deter- 
mining upper complexity-bounds, unary coding is evidently the less stringent 
accounting method, because the size of formulas involving counting quantifiers 
is, in general, exponentially larger than under binary coding. Pacholski et at, 
op. cit. showed that Sat-C 2 is in NEXPTIME, but only under the assumption 
of unary coding. The present paper shows that both Fin-Sat-C 2 and Sat-C 2 are 
in NEXPTIME, even with binary coding. We remark that both Fin-Sat-C 2 and 
Sat-C 2 are easily shown to be NEXPTIME-hard on either coding scheme. 

In the sequel, we confine attention to finite or countably infinite structures 
interpreting finite signatures of unary and binary predicates, without individual 
constants or function-symbols; in addition, we treat the equality predicate « as a 
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logical constant. The lack of individual constants in the signature constitutes no 
essential restriction of expressive power, because their effect can be reproduced 
using formulas of the form 3 = ixp(x). The presence of equality in the logic 
constitutes no extension of expressive power, because it can be defined by the 
formula \lx(x «i)A \/xB = iy(x « y). 

2 Preliminaries 

If is a formula, let ||</>|| denote the number of symbols in 0, assuming that a 
numerical quantifier subscript C involves log C symbols. Thus, if the quantifier 
3=c occurs in <f>, we have C < 2^1. The following type of normal-form lemma 
is standard in treatments of C 2 . (Recall that, in this paper, we are confining 
attention to signatures with no individual constants or function-symbols.) 

Lemma 1. Let <fi be a sentence in C 2 . We can construct, in time bounded by a 
polynomial function of \\(j>\\, a formula 

<j>* := Mxa A \/x\/y(/3 V x « y) A f\ \/x3 =Ch y{h{x, y) A x 56 y) 

l<h<m 

satisfying the following conditions: (1) a is a quantifier- free, equality-free for- 
mula with x as its only variable, (ii) (3 is a quantifier- free, equality-free formula 
with x and y as its only variables, (Hi) the fh (1 < h < m) are binary predicates, 
(iv) (f>* is satisfiable if and only if <fi is satisfiable over a domain of size greater 
than C = maxi< fe < m C h . 

Proof. Routine adaptation of textbook transformation to Scott normal form. 
See, for example, Gradel and Otto pQ Sec. 2.1 for an explanation of the required 
techniques. □ 

Wc remark that, since C is bounded by a singly exponential function of ||0||, 
satisfiability of 4> over domains of size C or less can certainly be determined in 
NEXPTIME. 

We next review some standard concepts. Fix a finite signature E of unary 
and binary predicates (no individual constants or function-symbols). A literal is 
an atomic formula or the negation of an atomic formula. A 1-type is a maximal 
consistent set of (equality-free) literals involving only the variable x. A 2-type 
is a maximal consistent set of equality-free literals involving only the variables 
x and y. If r is a 2-type, then the result of transposing the variables x and y in 
t will also be a 2-type, denoted r . If 21 is any structure interpreting E, and 
a G A, then there exists a unique 1-type tt(x) such that 21 (= 7r[a]; we denote it 
by tp a [a]. If, in addition, b e A is distinct from a, then there exists a unique 
2-type t(x, y) such that 21 |= r[a, b); we denote r by tp a [a, b]. We do not define 
tp a [a, b] if a = b. If n is a 1-type, we say that it is realized in 21 if there exists 
a G A with tp a [a] = n. If r is a 2-type, we say that r is realized in 21 if there 
exist distinct a, b G A with tp a [a, b] = t. 



2 PRELIMINARIES 



3 



Notation 1. Any 2-type t includes a unique 1-type, denoted tp 1 (r); in addition 
we write tp 2 (T) for tp 1 (r _1 ). 

Remark 1. If tp a [a, b] = t, then tp a [6, a] = r _1 , tp a [a] = t Pl (r) and tp a [6] = 
tP2( r )- 

We next introduce some non-standard concepts. 

Definition 1. A counting signature £ is a signature of unary and binary predi- 
cates (no individual constants or function-symbols) with a distinguished subset 
of binary predicates; we refer to these distinguished binary predicates as the 
counting predicates of E. 

Counting signatures help us analyse models of formulas in C 2 by allowing 
us to treat a subset of the binary predicates in a special way. Since counting 
signatures are signatures, we may meaningfully speak of 1-types and 2- types 
over counting signatures. 

Definition 2. Let E be a counting signature, and let r be a 2-type over E. We 
say that r is a message-type (over E) if, for some counting predicate / of E, 
f(x, y) St. If r is a message-type such that r _1 is also a message-type, we say 
that t is invertible. On the other hand, if r is a 2-type such that neither r nor 
t -1 is a message- type, we say that r is silent. 

Thus, a 2-type r is an invertible message- type if and only if there are counting 
predicates / and g such that f(x,y) £ r and g(y,x) E r. The terminology is 
meant to suggest the following imagery. If tp a [a, b] is a message-type //, then 
we may imagine that a sends a message (of type n) to b. (On this view, a can 
send at most one message to 6). And if (i is invertible, then b replies by sending 
a message (of type back to a. If tp a [a, b] is silent, then neither element 

sends a message to the other. 

The remainder of this section is devoted to three simple observations on 
structures interpreting counting signatures. 

Definition 3. Let 21 be a structure interpreting a counting signature E, and let 
Y be an integer. We say that 21 is Y -branching {over E) if, for every a S A and 
every counting predicate / of E, the set {a 1 e A\a ^ a' and 21 |= /[a, a']} has 
cardinality at most Y. We say that 21 is finitely branching if 21 is Y -branching 
for some FeN. 

Definition 4. Let 21 be a structure interpreting a counting signature E. We say 
that 21 is chromatic (over E) if, for all a, a', a" e A, the following two conditions 
hold: 

1. if a a' and tp a [a, a'] is an invertible message- type, then tp a [a] ^ tp a [a']; 

2. if a, a', a" are pairwise distinct and both tp a [a,a'] and tp a [a',a"] are in- 
vertible message- types, then tp a [a] ^ tp a [a"]. 
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Thus, a chromatic structure is one in which distinct elements connected by 
a chain of 1 or 2 invertible message-types have distinct 1-types. 

Lemma 2. Let E be a counting signature with exactly m counting predicates, let 
Y be a non-negative integer, and let 21 be a Y -branching structure interpreting E. 
Let E' be the signature formed by adding log((my) 2 + l) (rounded up) new unary 
predicates to E ; and regard E' as a counting signature by taking the counting 
predicates of E' to be the same as the counting predicates of E . Then 21 can be 
expanded to a chromatic structure 21' interpreting E'. 

Proof. Consider the (undirected) graph G on A whose edges are the pairs of 
distinct elements connected by a chain of 1 or 2 invertible message-types. That 
is, G = (A, E 1 U E 2 ). where 

E 1 ={(a,a') | a ^ a' and tp a [a,a'] is an invertible message-type} 

E 2 ={(a, a") \ a ^ a" and for some a' € A, (a, a 1 ) and (a 1 , a") are both in E 1 }. 

Since there are only m counting predicates in E and 21 is ^-branching, the 
degree of G (in the normal graph-theoretic sense) is at most (mY) 2 . Now 
use the standard greedy algorithm to colour G with (mY) 2 + 1 colours. By 
interpreting the log((my) 2 + 1) (rounded up) new unary predicates to encode 
these colours, we obtain the desired expansion 21'. □ 

Definition 5. Let 21 be a structure and Z an integer. We say that 21 is Z- 
differentiated if, for every 1-type 7r, the number u of elements in A having 
1-type 7r satisfies either u < 1 or u > Z . 

Lemma 3. Let E be a counting signature, let 21 be a structure interpreting E, 
and let Z be a positive integer. Let E' be the signature formed by adding log Z 
(rounded up) new unary predicates to E ; and regard E' as a counting signature 
by taking the counting predicates of E' to be the same as the counting predicates 
o/E. Then 21 can be expanded to a Z -differentiated structure 21' interpreting 
E'. Moreover, if% is chromatic over E, we can ensure that'll' is chromatic over 
E'. 

Proof. For each 1-type tt realized more than once but no more than Z times, 
colour the elements having 1-type tt using Z different colours. By interpreting 
the log Z (rounded up) new unary predicates to encode these colours, we obtain 
the desired expansion 21'. This process clearly preserves chromaticity. □ 

Definition 6. Let E be a counting signature, let 21 be a structure interpreting 
E, and let 7r, n' be 1-types over E. We say that ir and tt' form a noisy pair in 
21 if there do not exist distinct a, a' 6 A such that tp^fa] = ir, tp a [a'] = 7r', and 
tp a [a, a'] is silent. 

Thus, if 7r and 7r' form a noisy pair, every element with 1-type tt either sends 
a message to, or receives a message from, every element with 1-type tt'. Note 
that Definition does not require tt and tt' to be distinct. 
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Lemma 4. Let Y, be a counting signature with exactly m counting predicates, 
let Y be a non-negative integer, let 21 be a Y -branching structure interpreting 
£, and let tt,tt' be 1-types over E. If it and ir' form a noisy pair in 21, then 
either there are at most (mY + l) 2 elements having 1-type ir, or there are at 
most (mY + l) 2 elements having 1-type it' . 

Proof. Suppose for contradiction that 7r and tt' form a noisy pair, but that there 
are more than (mY + l) 2 elements having 1-type tt and more than (mY + l) 2 
elements having 1-type tt'. Now let B be a set of elements having 1-type tt, 
with \B\ > (mY) 2 + mY; and let B' be a set of elements having 1-type tt', 
disjoint from B, with \B'\ = mY + 1. Such sets can evidently be found even 
if tt = tt'. Select any b £ B. Since b sends a message to at most mY elements 
in B' , there exists b' £ B such that b sends no message to b' . Since tt and tt' 
form a noisy pair, we have that, for all b £ B, there exists b' £ B' such that 
b' sends a message to b. But the total number of elements 6 £ A such that b' 
sends a message to b for some b' £ B' is bounded by mY\B'\ = mY(mY + 1), 
contradicting the supposition that \B\ > (mY) 2 + mY. □ 

3 Profiles and approximations 

Let £ be a finite counting signature with counting predicates fx, ... , f m . Enu- 
merate the invertible message-types over £ (in some arbitrary order) as 
jUi,.. . ,^Af*, and the non- invertible message- types over E (again in some ar- 
bitrary order) as /j,m*+i, ■ ■ ■ ,/J>M- (Thus, fix, . . . is an enumeration of all 
the message- types over £.) Denote the set of silent 2- types over S by H. We 
fix the symbols E, m, f h (1 < h < m), A/*, M, [i 3 (1 < j < M) and 5 to have 
these meanings throughout this section. 

Wc introduce the concepts of the Yl-profile and the II- count of an element in 
a structure interpreting E, where II is a set of 1-types. 

Notation 2. Let 21 be a structure interpreting E, and let tt be any 1-type and 
II any set of 1-types over E. When 21 is clear from context, denote by A n the 
set {a £ yl|tp a [a] = 7r}, and denote by An the set 1^^!^ £ II}. In addition, 
denote by II C the set of all and only those 1-types over E not contained in II. 

Remark 2. For any 21 interpreting E and any set of 1-types II over E, A^ = 
A\A n . 

For the next definition, recall that fJ,x, ■ ■ ■ f-M arc the message- types (invert- 
ible and non- invertible) over E. 

Definition 7. Let 21 be a finitely branching structure interpreting E, let a £ A, 
and let II be any set of 1-types over E. The H-profile of a in 21, denoted pr^[a], 
is the M-element integer vector whose jth element (1 < j < M) is given by: 

\{b £ An : b ^ a and tp a [a, b] = f-ij}\. 

If LT is the set of all 1-types over E, we call pi'n[i] simply the profile of a in 21, 
and denote it pr a [a] . 
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Think of the vector pr a [a] as a description of the 'local environment' of a. 
The intuition is that a's local environment tells us, for each message-type /i, 
how many elements a is related to by ^; but it tells us nothing about how those 
elements are related to each other, or about the elements of A to which a does 
not send a message. More generally, pr a [a] is a partial description of that local 
environment — one that ignores elements whose 1-type is not in the set II. 

For the next definition, recall that fx, ... , f m are the counting predicates of 

S. 

Definition 8. Let 21 be a finitely branching structure interpreting E, let a e A, 
and let II be any set of 1-types over E. The TL- count of a in 21, denoted ct a [a], 
is the m-element integer vector whose hth element (1 < h < m) is given by: 

\{beA n :b^a and 21 \= fh[a,b}}\. 

Furthermore, a set B C A is called a H-group if every element of B has the 
same 1-type and every element of B has the same II-count. If II is the set of all 
1-types over E, we call ct a [a] simply the count of a in 21, and denote it ct a [a]. 

Think of the vector ct a [a] as providing a 'statistical summary' of the vector 
pr§[a]. In particular, pr a [a] determines ct a [a], but not conversely. 

Definition 9. Let 21 be a finitely branching, chromatic structure interpreting 
E, let II be a set of 1-types over E, and let B be a subset of A. A structure 
21' over the domain A is a (II, B)- approximation to 21 if (i) 21' is chromatic; (ii) 
every 2-type realized in 21' is also realized in 21; and (iii) for all a £ A: 

1. tp a '[a] = tp a [a]; 

2. prgl[a] = pr^ c [a]. 

3. a 6 A \ B implies pr a [a] = pr a [a]; 

4. a 6 B implies ct a [a] = ct a [a]. 

In Condition 0] of the above definition, the restriction that a e B is in fact 
logically redundant; since if a ^ B, Condition^certainly entails ct a [a] = ct a [et]. 

Remark 3. If 21' is a (II, ^-approximation to 21, II C II', and B C B' , then 
21' is also a (II', i3')-approximation to 21. If 21' is a (II, ^-approximation to 21, 
and 21" is a (II, i?)-approximation to 21', then 21" is a (II, ^-approximation to 
21. 

Our strategy now is to show that, in favourable circumstances, (II, B)- 
approximations can be obtained in which the elements of B exhibit 'few' profiles. 
We first consider the special case where II is a singleton. For the next definition, 
recall that /Lti, . . . , /Xm* are the invertible message-types over E; thus, the first 
M* coordinates of any 7r-profile pr a w j [a] tell us, for each invertible message-type 
ji, how many elements with 1-type 7r a sends a message of type /i to. 



3 PROFILES AND APPROXIMATIONS 



7 



Definition 10. Let tt be a 1-type over S, let 21 be a finitely branching structure 
interpreting S, and let B be a subset of A. We say that S is a it- patch if i? is a 
{7r}-group and, for all a, b G B. the vectors pr9Ja] and pr? ,[&] agree in each 
of their first M* coordinates. 

Lemma 5. Let 21 be a chromatic, Y -branching structure interpreting S, let tt 
be a 1-type over S, and let B C A be a tt -patch. If% is (mY +1) 2 -differentiated, 
then there exists a structure 21' such that 21' is a ({tt}, B)- approximation to 21 
in which the elements of B all have the same {^{-profile. 

Proof. Let tt* be the 1-type such that B C A\* , and suppose that 21 is (mY+1) 2 - 
diffcrcntiatcd. If \B\ < 1, 21' — 21 obviously satisfies the conditions of the 
Lemma. And if \Ak\ < 1, any two elements with the same {7r}-count have the 
same {7r}-profile, so that 21' = 21 again satisfies the conditions of the Lemma. 
So we may suppose that B and A^ both contain more than one clement, whence 
|Ar* I > (mY +l) 2 and \A^\ > (mY + l) 2 . By Lemma|H let r be a silent 2-type 
such that, for some a 6 A^* and some a' £ ^4^, tp a [a, a'] = r. 

For a G B, let 

■^o = { a ' G | a ^ a' and tp a [a, a'] is a non-invertible message- type}; 

and for a ^ B, let ^4 a = 0. Notice, incidentally, that a' G A a implies a ^ ^4 a /. 
Choose 6 G B for which is smallest, and fix b. Enumerate Af, (finite or 
infinite) as bi,bi, ■ ■ ■ ■ For any a G B not equal to b, let A a be a subset of A a 
having the same number of elements as A^, and enumerate A a as a\, a^, ■ ■ ■ ■ 

We now define the structure 21' by assigning 2- types as follows. For all a G B 
such that a^b, set 

tp a '[o,o i ]=tp st [6,6 i ] I (1) 
where z ranges over the enumeration of A a , and set 

tp a '[a,a'] = r, (2) 

where a! is any element of A a \ A a . In addition, for all distinct a, a' such that 
a' G" ^4 a and a g" j4 a ' , set 

tp a '[o,o'] =tp 2l [a,a / ]. (3) 

Since a' G A a implies a A a /, none of these assignments overwrites any other. 
And since B C A^. , the 1-type assignments implicit in never clash: 

indeed, we have tp a [a] = tp a [a] for all a G A. Furthermore, the transformation 
from 21 to 21' does not affect invertiblc message- types. That is: for distinct a, a! , 
tp a [a, a'] is an invertiblc message-type if and only if tp 21 [a, a'] is an invertible 
message- type; and moreover, if tp a [a, a'] is an invertible message- type, then 
tp a '[a,a'] =tp 2l [a,a']. 

We now verify that 21' is a ({tt}, ^-approximation to 21. From the remarks 
of the previous paragraph and the fact that 21 is chromatic, we have that 21' 
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is also chromatic. In addition, it is immediate from JTJ J3J) that every 2-type 
realized in 21' is also realized in 21. Now let a be any element of B. Since 
B is a 7r-patch, the vectors wf^\ [°] an d P 1 ^} [&] by definition agree in their 
first M* coordinates (corresponding to the invertible message- types). Hence, 
since we have just shown that the transformation from 21 to 21' does not affect 
invertible message- types, the vectors pr ^ j [a] and P r |^} [b] also agree in their 
first M* coordinates. Furthermore, the assignments JIJ and guarantee that 
pr?, [a] and pr? -.[b] also agree in the remaining coordinates M* + 1, . . . ,M 
(corresponding to the non- invertible message-types). Hence we have: 

prf; } [a]=prf T} [6] for all a E B. (4) 

It is now a simple matter to check the numbered conditions in Dcfinition|ni(iii). 
Let a £ A. 

1. We have already established that tp 21 [a] = tp a [a]. 

2. Let a' be any element of As n \c with a ^ a'. Then certainly a' $ A a C A ff , 
so that tp 21 [a, a'] can be different from tp 21 [a, a'] only if a £ A a > . But 
if a £ A a i, then neither tp a [a, a'] nor tp 21 [a, a'] can be a message-type. 
Hence, pr|£ }0 [a] = pr|^ }e [a]. 

3. Suppose a £ A \ B, and let a' be any element of A with a ^ a'. The 
argument now proceeds much as for the previous condition: certainly, 
a' g" A a = 0, so tp a [a, a'] can be different from tp a [a, a'] only if a £ A a r. 
But if a £ yl a ' , then neither tp 21 [a, a'] nor tp 21 [a, a'] can be a message-type. 
Hence pr 21 [a] = pr a [a] . 

4. If a £ B, Equation Q yields ct?\[a] = ct? J6]. And since B is a {7r}- 
group, ct? w} [6] = ct? ff} [a]. It follows that ct^^a] = ct^fo]. 

Finally, it is immediate from Equation Q that all elements of B have the same 
{7r}-profile in 21'. □ 

Thus, Lemma 03 assures us that, as long as we are content to work with 
({■7r}, _B)-approximations to highly differentiated structures, we may unify the 
{7r}-profiles of the elements in any 7r-patch B. 

We next deal with the case where n is not a singleton. Here, the possibilities 
for reducing the number of n-profiles realized in a n-group are more limited. 

Lemma 6. Let % be a finitely branching structure interpreting E, let H be a set 
of 1-types over E, let B C A be a H-group, and let m be a permutation of B. 
Then there exists a structure 21' interpreting E over the domain A such that 21' 
is a (n, B)- approximation to 21, and for all b £ B, pr a = pr^[b]. 

Proof. First, extend u> to the whole of A by setting u>(a) = a for a € A \ B. 
Next, for all b £ A, define: 



Wbn(a) 



u(a) if b £ An 
a otherwise; 



3 PROFILES AND APPROXIMATIONS 



9 



Thus, uibu is a permutation of A (which may be the identity). Since (wmi) -1 and 
(cj _1 )hn are the same permutation, we may unambiguously write u)^ n . Clearly, 
oj fixes B setwise and A\B pointwisc; so, therefore, does w^rf. Moreover, since 
B is a Il-group, either B C An or B C An? = A \ A n . Hence, v fixes both A u 
and Aw setwise, and so therefore does w^J. 

Define the structure 21' over domain A by setting, for all distinct a, a' G A: 

tp*'M=tp*[ U J n (a),w£(a% (5) 

To show that 21' is well-defined, we must show first that the elements w~'n(°0 
and w~]-r(a') in each instance of JSJ are distinct, and second, that the 1-type 
assignments implicit in the different instances of JSJ do not clash. Suppose, 
then that a ^ o'; we prove that uj~, u (a) ^ to~^{a'). Since the permutations w~ n 
and fix B setwise and A \ B pointwise, we may assume that a, a' G B. If 
B C An, then w~, n (a) = uj~ 1 (a) and ui~^(a') = uj~ 1 (a'); if, on the other hand, 
B C A\A 7T , then cj~, n (a) = a and w~^(a') = a'. Either way, w~/ n (a) w an( a ')- 
Next, we prove that the 1-type assignments in J5j) never clash. Since all elements 
of B have the same 1-type, and since lo is the identity outside B, we have, for all 
a, a', tp a [w~, n (a)] = tp a [a]; thus, tp a [w~, n (a)] does not depend on a'. Hence, 
the 1-type assignments implicit in (0) cannot clash, and 21' is indeed well-defined. 
In fact, this argument establishes that tp a [a] = tp a [a] for all a G A. 

We first check the numbered conditions of Definition (iii) in turn. Let a G A. 

1. We have just established that tp a [a] = tp a [a]. 

2. For all b G An", uj^n( a ) = a; in particular, if a G An<=, then u)~^(a) = a. 
Therefore, w~ n is in fact a permutation of An<= \ {a}, and moreover, for 
all b G A n c \ {a}, tp a '[a,6] = tp a [a, w~£(b)]. Thus, the list of 2-types 
tp a [a, b] obtained as b ranges over An? \ {a} is (in some order) the list 
of 2-types tp a [a, b 1 ] obtained as 6' ranges over An" \ {a}. It follows that 
prgl[a] = prg [a]. 

3. If a G A \ B, then, for all b G A, w^(a) = a; in particular, uj^(a) = a. 
Therefore, uj~^ is a permutation of j4\{a}, and moreover, for all b G A\{a}, 
tp a [a, b] = tp a [a,cj~n(fo)]. Thus, the list of 2-types tp a [a, b] obtained as b 
ranges over A\ {a} is (in some order) the list of 2-types tp a [a, b'] obtained 
as b 1 ranges over A \ {a}. It follows that pr a [a] = pr a [a]. 

4. For all b G An, ^^(a) = uj^ 1 (a); in particular, if a G An, then uj^(o) = 
w _1 (a). Therefore, u>~^ is a bijection from the set An \ {a} to the 
set An \ {u^ 1 ^)}, and moreover, for all b G An \ {a}, tp a [a, 6] = 
tp a [w _1 (a), ^"^(6)]. Thus, the list of 2-types tp a [a, b] obtained as b 
ranges over An \ {a} is (in some order) the list of 2-types tp a [w _1 (a), &'] 
obtained as b' ranges over An \ {a; -1 (a)}. It follows that 



pr a '[a]=prg[a;- 1 (a)]. 



(0) 
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Certainly, then, we have ct§ [a] = ct§[w 1 ( a )]- But since B is a II-group 
in 21, ctg^-^o)] = ctg[o]. Hence ctf}'[a] = ctg[a]. 

We have thus established that, for all a £ A, pr^} [a] = pr^[w _1 (a)] and 
pr^ c [a] = prj^ c [a] . Since 21 is chromatic, it follows easily that 21' is chromatic 
and that all 2-typcs realized in 21' arc realized in 21. Hence, 21' is a (n, B)- 
approximation to 21. Finally, it follows by putting a — u>(b) in Equation (jBJ 
that, for all b £ B, prg' [u(b)] = prg [b] . □ 

Let 21, n and B satisfy the conditions of LemmaEl That lemma then assures 
us that, as long as we are content to work with (n, ^-approximations, we may 
permute the n-profiles of the elements in any n-group B at will! To see the 
power of this idea, let 21 be a structure, let n' and n" be disjoint sets of 1-types, 
and let B be a subset of A whose elements realize J' different n'-profiles and J" 
different n"-profilcs. If II = II' U II", how many Il-profilcs arc realized by the 
elements of B? Answer: in general J' J"; however, the following lemma shows 
that, under certain circumstances, we can find a (II, _B)-approximation to 21 for 
which the answer is smaller. 

Lemma 7. 21 be a finitely branching structure interpreting S, let II', II" be 
disjoint, nonempty sets of 1-types, and let II = II' U II". Suppose B C A is 
both a II' -group and a II" -group, and hence also a H-group. Let the number 
°f different II' -profiles realized in 21 by the elements of B be J' ; and let the 
number of different II" -profiles realized in 21 by the elements of B be J" . Then 
there exists a (II, B)- approximation 21" to 21 in which at most J' + J" different 
H-profiles are realized by the elements of B. 

Proof. For perspicuity, we give the proof where B is finite; the modifications 
required for the general case arc easy. Enumerate B as b\, . . . 6j. Let the various 
Il'-profilcs realized by at least one element of B be u( , . . . , v'j, ; and let the various 
n"-profiles realized by at least one element of B be v", . .., v'j,,. Since B is a 
n'-group, Lemma guarantees that we can obtain a (II', _B)-approximation to 
21 in which the Il'-profilcs of B are permuted at will. So let 21' be a (II', B)- 
approximation to 21 in which the Il'-profiles of the b\ , . . . bi fall into consecutive 
blocks in the sense depicted in the middle column in Fig. ^ More precisely, we 
have integers = I\ < 1% < ■ ■ ■ < Ij'+i = I such that, for all j (1 < j < J'), 
pr a [bi] = v'j for i in the range [Ij + l,Ij + i]. Since B is a also II"-group, we can 
likewise obtain a structure 21" such that 21" is a (II", £?)-approximation to 21' in 
which the elements of B have II"-profilcs likewise arranged in consecutive blocks. 
Since 21" is a (II", £?)-approximation to 21' and the sets II' and II" are disjoint, 
the n'-profiles of the elements of B will be unaffected by the transformation 
from 21' to 21": a typical alignment of Il'-profilcs and IT'-profiles in 21" is shown 
in Fig. ^ By inspection, at most J'+ J" II-profiles are realized in 21" by the 
elements of B. From Remark |3 21" is a (II, ^-approximation to 21, because 21' 
is a (LI', B)-approximation to 21 and 21" is a (II", ^-approximation to 21'. □ 

This leads us to the main result of this section. 
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of B 


n'-profilc in 21' 
(and also in 21") 


iT-profile in 21" 


h 


v'l 


v" 




v' 2 






v'J„ 





Figure 1: Arrangement of Il'-profiles and n"-profilcs in B. 

Lemma 8. Let$l be a Y -branching, [rnY + l) 2 -differentiated, chromatic struc- 
ture over E ; let I > 0, let IT be a non-empty set of 1-types such that IT| < 2 l , 
and let B C A be a TL-group. Then there is a structure 21' such that 21' is a 
(II, B)- approximation to 21 and the number of different TL-profiles realized in 21' 
by the elements of B is at most 2 l (M* + l)(Y + \) lm . 

Proof. By induction on /. To aid readability, let Ki stand for 2 l (M* + l)(Y + 
l) lrn . If I = 0, let n = {tt}. Decompose B into maximal 7r-patches B\, . . . Bh- 
Since 21 is chromatic, the first M* coordinates of wf^} H are either all zero, or 
else are all zero except for a single occurrence of unity. Therefore, H < AT* + 1. 
Now let 2to = 21, and for all h (1 < h < H), apply Lemma [3] to obtain a 
structure 21^ such that 21^ is a ({tt}, -B/^-approximation to 2l^_i in which the 
elements of Bh all have the same {7r}-profile. By Remark|Sl 2l# is a ({ir},B)- 
approximation to 21. And because the Bh are pairwise disjoint, 1 < h < h' < H 
implies pr a, »' [a] = pr a ' l [a] for all a 6 Bh- Hence, the total number of {tt}- 
profiles realized by elements of B in 2l# is at most H < M* + 1 = Kq. Thus, 
setting 21' = 21^ establishes the case / = 0. 

Now suppose I > 0. We may assume IT is not a singleton, since otherwise, we can 
employ the argument of the case I — 0; so let IT be partitioned into non-empty 
sets IT' and IT" each of cardinality at most 2 l ~ 1 . Also, partition B into maximal 
IT'-groups Bi, . . . , Bh (say). Since B is a ITgroup and IT' n II" = 0, the B\, 
. . . , Bh will also be IT"-groups. Moreover, since 21 is y-branching, the IT-count 
of any element in 21 is one of at most (Y + l) m different vectors; and since B is 
a IT-group, all elements of B must have the same 1-type, whence H < (Y + l) rn . 
Again, let 2lo = 21, and consider the set B\. By inductive hypothesis, let 21^ be 
a (IT', _Bi)-approximation to 2lo in which at most K\-\ IT'-profiles are realized 
by the elements of B\. Again, by inductive hypothesis, let 21" be a (IT",i?i)- 
approximation to 21^ in which at most Il"-profiles are realized by the 
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elements of B\. Thus, in the structure 21", B\ is a Il'-group realizing at most 
Ki-i different Il'-profiles, and also a II"-group realizing at most Ki-\ different 
II"-profiles. By Lemma[71 let 2ti be a (II, B\ )-approximation to 21" in which the 
elements of B\ realize at most 2iQ_i different II-profiles. By Remark El 21 1 is 
a (n, B\ )-approximation to 2to- Treating the sets i?2, • • • , Bh in the same way, 
we obtain structures 2lh {1 < h < H) such that, for each /i in this range, 21?,, is 
a (II, _B)-approximation to 2U-1 in which at most 2if;_i different n-profiles are 
realized by the elements of Bh- By Remark EJ 21 # is a (II, B)-approximation 
to 21. And because the Bh are pairwise disjoint, 1 < h < h' < H implies that 
pr 2 ^' [a] = pi Mh [a] for all a 6 Bh. Hence, the total number of II-profiles realized 
by elements of B in 2l# is at most 

2HK l _ 1 < 2{Y + l) m K l _ 1 
= Kl 

Thus, setting 21' = 2l# completes the induction. □ 

4 Deciding finite satisfiability 

Let E be a signature of unary and binary predicates, let a(x) be a quantifier- 
free, equality-free formula over E with x as its only variable, let (3{x, y) be a 
quantifier-free, equality-free formula over £ with x and y as its only variables, 
let m be a positive integer, let fx, ... , f m be binary predicates, let C\, . . . , C' rn 
be positive integers, let (f>* be the formula 

4>* :=VxaAVxVy(/3Vxtty) A /\ \Jx3 =Ch y{f h {x, y) A x 96 y), (7) 

l</i<m 

and let C = max 1 <^<„ 1 C^. Make E into a counting signature by declaring the 
counting predicates of E to be exactly {fx, . . . , f m }. Let the number of symbols 
(unary and binary predicates) in E be s, so that the total number of 1-types 
over E is L = 2 s . As in Sectional enumerate the invertible message- types 
over E (in some arbitrary order) as . . . ,(J-m*, enumerate the non-invertible 
message-types over E (again in some arbitrary order) as /Um*+1j ■ • ■ ,Mm, and 
denote the set of silent 2-types over E by S. We fix the symbols a, (3, m, fh 
(l<h< m), C h (1 < h < m), C, <f>*, E, s, L, M* , M, fij (1 < j < M) and S 
to have these meanings throughout this section. 

The motivation for introducing (II, ^-approximations in Section |3 is that 
they are good enough for checking the (finite) satisfiability of <j>* . 

Lemma 9. Let 21 be a finitely branching, chromatic structure interpreting E, 
let II be a set of 1-types over E, let B be a subset of A, and let 21' be a (II, B)- 
approximation to 21. 7/21 |= <jf , then 21' \= <f>* . 

Proof. By Remark |3 we may assume without loss of generality that II is the 
set of all 1-types and B = A. Since tp a [a] = tp a [a] for every a € A, 21' |= Vxa. 
Since every 2-type realized in 21' is also realized in 21, 21' |= \/xiy(l3\/ x y). And 
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since ct a '[a] = ct a [a] for every a € A, %' \= /\ 1 < h < m ^x3 =Ch y(fh(x 1 y) Ax ^ 

y). □ 

The next definition relics on conventions established in Notation [H 

Definition 11. A star-type (over E) is a pair er = (71", u), where 7r is a 1-type 
over E and v = (v±, . . . vm) is a vector over N satisfying the condition that, for 
a ll j (1 < j < > implies tp 1 (/x :) ) = 7T. We say that er is chromatic if 

the set of integers 

{vj I 1 < j < M* and tp 2 (/^) = tt'} 

sums to either or 1 for every 1-type tt', and sums to in the case tt' = tt. If 21 
is a finitely branching structure interpreting E and a S A, then (tp a [a], pr a [a]} 
is evidently a star- type, which we denote by st a [a]. We say that the star- type 
er is realized in 21 if er = st a [a] for some a e A 

We note in passing that, if (71", v) is a star-type, and v is not the zero- vector, 
then tt is actually determined by v. 

Notation 3. If er = (tt, v) is a star-type with v = (i>i, . . . Vm), wc denote tt by 
tp(cr) and Vj by er[j] for all j (1 < j < M). 

Remark 4. If 21 is a finitely branching structure interpreting E and a e A, 
then tp(st a [a]) = tp a [a]. Moreover, 21 is chromatic if and only if every star- 
type realized in 21 is chromatic. 

Remark 5. Let er be a chromatic star-type and let j and j' be integers between 
1 and M* (so that fj,j and fj,ji are invertible message-types). If /ij 1 = fjtji, then 
either a[j] — or cr[j'] = 0. In particular, if /.tj 1 = fij, then a[j] = 0. 

Definition 12. Let 21 be a finitely branching structure interpreting E, and let 
X be a positive integer. We say that 21 is X- sparse if 21 realizes no more than 
X different star-types — that is, if |{st a [a] : a € A}\ < X. 

For the next lemma, recall that s is the number of symbols in E, m is the 
number of counting predicates in E, and C = maxi<j,< m Ch- 

Lemma 10. Let Z > (mC + I) 2 , and let X = 4 S (16 S + 1)(C + l) sm . If 
there is a chromatic, Z -differentiated model of <jf interpreting E, then there is a 
chromatic, Z -differentiated, X -sparse model of <jf interpreting E over the same 
domain. 

Proof. Suppose 21 is a chromatic, Z-diffcrentiated model of <j>* interpreting E. 
Since 21 |= (f>*, 21 is C-branching. Note that M* < 2 is . Let II be the set of all 
1-types over E; thus, |II| = 2 s . Let Ai, . . . Ah be a list of the nonempty sets A„, 
where tt £ II; thus, H < 2 s . The sets Ah together partition A; moreover, since 
21 |= (j)*, each Ah (1 < h < H) is a IT-group. Letting 2l = 21, by Lemma 
we can obtain 2ti, . . .21^ such that, for all h (1 < h < H), 21/j is a (IT, Ah)- 
approximation to 2l/,,_i in which at most 2 S (16 S + 1)(C+ l) sm different profiles 
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are realized by the elements of Ah- Using by now familiar reasoning, 21 # is 
therefore a (II, ^-approximation to 21 realizing at most iT.[2 s (16 s + 1)(C + 
l) sm ] < 4 S (16 S + l)(C + l) sm = X different star-types. By Lemma^l 2l ff \= <t>*. 
By Definition 21 # is chromatic; and since tp aH [a] = tp a [a] for all a £ A, 21 h 
is also Z-differentiated. Thus 21' = 2l# is the required structure. □ 

Notation 4. We write X to denote the set of (unordered) pairs of (not necessar- 
ily distinct) integers between 1 and L. Formally: X = {{i,i'} | 1 < i < i' < L}. 

For the next definition, recall that 3 is the set of silent 2-types over E. 

Definition 13. A frame over E is a tuple T = (<x, /, 0), where a = (oi, . . . , crjv) 
is a list of pairwise distinct star- types over E, I is a subset of X, and 9 is a 
function 6 : I — ► H such that, for all {i, i'} £ / with i < i', tp x (0({i, i'})) = 7r.; 
and tp 2 (0({i, i'})) = 7Tj'. The dimension of ct is N. For V a positive integer, 
T is Y-bounded if, for all fc (1 < k < N) and all j (1 < j < M), a k [j] < Y. 
Finally, T is chromatic if every a k is chromatic. 

Think of a frame as a (putative) statistical summary of a structure 21, 
specifically: 

Definition 14. Let T = (a, 1,0) be a frame over E, and let 21 be a structure 
interpreting E. We say that T describes 21 if the following conditions hold: 

1. a is a list of all and only those star- types realized in 21; 

2. / is the set of all and only those {i, i'} (1 < i < i' < L) such that 7Tj and 
7Tj/ do not form a noisy pair in 21 (see Definition EJ) ; 

3. for each {i,i'} G /, there exist a G A n and a' G A w / such that a ^ a! and 
tp a [a,a'] =0({i,i'}). 

Every finitely branching structure 21 is evidently described by some (not 
necessarily unique) frame; and certain interesting properties of 21 correspond to 
obvious properties of the frames which describe it, as we see from the following 
two lemmas. 

Lemma 11. Let % be a finitely branching structure interpreting E and let T be 
a frame over E which describes 21. Then: (i) 21 is chromatic if and only if T is 
chromatic; (ii) 21 is X -sparse if and only if T has dimension at most X; and 
(iii) z/2l is Y -branching then T has bound Y . 

Proof. Immediate. □ 

For the next definition, recall that a 1-type 7r is simply a finite collection of 
formulas, so that /\ir denotes the conjunction of those formulas; similarly for 
2-types. 
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Definition 15. Let T = (a, I, 8) be a frame over E, where er = (o"i, . . . , cat). 
We write T |= 0* if the following conditions are satisfied: 

1. for all k (1 < k < N) \= f\tp(a k ) -> a; 

2. for all jfe (1 < k < N) and all j (1 < j < AJ), if cr k [j] > then 
1= A Mi P(x,v) ^P(y,x); 

3. for all {*, i'} G 7, (= A i'}) -> y) A /%, x); 

4. for all fc (1 < k < N) and all h (1 < /i < to), the integers in the set 
{o-fcbl I 1 < j < M and fh(x, y) £ Mj} sum to Cfe. 

Remark 6. Let .F = (cr, I, 0) be a frame over E, where a = (ci, . . . , oat). If 
^Hf, then, for all k (1 < fc < N), Ei< 3 <m °k[j] < mC. 

Lemma 12. Let 21 be a finitely branching structure interpreting E and let T be 
a frame over E describing 21. J/21 |= 0*, £/ien |= </>*. 

Proof. Almost immediate. □ 

However, while every finitely branching structure is described by some frame, 
not every frame describes a structure; and it is important for us to define a class 
of frames which do. Recall that L is the number of 1-types over E and M* the 
number of invertible message- types over E. 

Notation 5. Let T = (<x, J, 0) be a frame over E, where a = (oi, . . . , ctjv). If 
T is clear from context, for integers i,j, k in the ranges 1 <i < L, 1 < j < M*, 
1 < k < N write: 

J 1 if tp((Tfc) = TTi 

[ otherwise; 

f 1 if, for all j (1 < j < M), tp 2 (^ ) = ir t implies cr fc [j] = 
1 otherwise; 

^2{a k lj} | M* + 1 < j < M and tp 2 ( W ) = 7r. t }; 
^{^'bl I 1 < J < M and tp 2 (/ij) = Tj}. 



Otfc = 

Pik = 
qjk = 

Tik = 
Sik = 



Remark 7. Let T be a frame over E, let 21 be a structure interpreting E, and 
suppose T describes 21. In that case, the symbols o,^, p^, qjk, and Sik in 
Notation have the following interpretations with respect to 21: 

1. Oik = 1 just in case every element with star-type Ok has 1-type TXi) 

2. Pifc = 1 just in case no element with star-type o~k sends a message to any 
element having 1-type 7Tj; 
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3. qjk counts how many messages of type \ij any element having star-type 
<7fc sends; 

4. r,;fc is the total number elements of 1-type ~Ki to which any element having 
star-type <7k sends a non-invertible message; and 

5. Sik is the total number elements of 1-type TTi to which any element having 
star-type o~k sends a message. 

With this notation in hand we can characterize a class of frames whose 
members are guaranteed to describe finite structures. 

Definition 16. Let T = (a, I, 9) be a frame over £, where a = (<7i, . . . , ctjv). 
Let w = (wi, . . . , wn) be a vector of positive integers. Using Notation for all 
i (1 < i < L), all i' (1 < i' < L) and all j (1 < j < M*), let: 

l<k<N l<k<N l<k<N 

Finally, let Z be a positive integer. We say that w is a Z-solution of if the 
following conditions are satisfied for all i (1 < i < L), all i' (1 < z' < L), all j 
(1 < j < M*) and all k (1 < fc < N): 

(CI) = iy , where j' is such that jij 1 = [ij>: 

(C2) s lfc < Ui ; 

(C3) Uj < 1 or m > Z\ 

(C4) if Oik = 1, then cither ui > 1 or r^k < aVi! 

(C5) if {z, z'} /, then either u,i < 1 or uy < 1; 

(C6) if {z, z'} / and Oik = 1, then either Ui > 1 or ?Vfc > Xin. 

Remark 8. Let T be as in Definitional an( l suppose that 21 is a finite structure 
interpreting £ such that 21 is described by T . For all k (1 < k < AT), let u>fc be 
the number of elements of A having star-type in 21. In that case, the symbols 
Ui , and xw in Definition 1161 have the following interpretations with respect 
to 21: 

1. Mi is the number of elements of a € A such that tp a [a] = 7T;; 

2. Vj is the number of pairs (a, b) £ A 2 such that a ^ b and tp a [a, b] = fij; 

3. xu> is the number of elements of a G ^4 such that tp a [a] = 7r.; and a does 
not send a message to any element having 1-type ~ny . 

The following Lemma shows that our definition of ^-correctness is not too 
stringent for the sorts of structures that interest us. 
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Lemma 13. Let % be a finite, Y -branching and Z -differentiated structure in- 
terpreting S, with Z > (mY + l) 2 . Let T = (a, 1,9) be a frame over E. If J 7 
describes 21, then T is has a Z-solution. 

Proof. Let a = (oi, . . . , ctjv), and let Wk = \{a G A : st a [a] = at}\ for all 
fc (1 < k < N). We show that w = (w\, . . . , wn) is a Z-solution of T. In 
doing so, we make free use of Remarks and 03 Note that, by construction, the 
W\, . . . , Wat are all positive. 

CI: If /ij 1 = jUj/, then the sets {(a, 6) | a ^ b and tp a [a,6] = /^} and {(a, 6) | 
a ^ b and tp a [a, 6] = /iy} can obviously be put in 1-1 correspondence, namely: 
(a, 6) i — > (6, a). But the cardinalities of these sets are and iy, respectively. 

C2: Since J 7 describes 21, any element of A having star-type Ofc sends a message 
to exactly Sik elements having 1-type 7Tj. But Uj is the number of elements of 
A having 1-type 7Tj. Since o~k is realized in 21, Sjfe < Uj. 

C3: Immediate given that 21 is Z-differcntiated. 

C4: If Oife = 1 and Uj < 1, then Uj = 1, so that 21 contains exactly one element 
with 1-type m; moreover, this element has star-type <7fc. Denote this element by 
a. Thus, a sends a non-invertible message to exactly rv/c elements with 1-type 
7iv . Clearly, none of these elements sends a message back to a (since otherwise 
a's message to it would be invertible), so that there exist at least r^k elements 
with 1-type 7iy which do not send a message to a. But since a is the only 
element with 1-type iti, there exist at least ry^ elements with 1-type 7iy which 
do not send a message to any element of 1-type 7^. In other words, ry^ < x^n. 

C5: Since T describes 21, {i,i'} $ / implies that the 1-types 7^ and 7Tj' form 
a noisy pair in 21. In that case, by Lemma 0] either Ui < (mY + l) 2 or u,< < 
(mY + l) 2 . Since 21 is Z-differcntiated and (mY + l) 2 < Z, Ui < 1 or m< < 1. 

C6: As already observed, if Ojfc = 1 and Ui < 1, then 21 contains exactly one 
element a having 1-typc 7Tj; moreover, this element has star-type o~k, and Xin is 
the number of elements of 1-type 7iy which do not send a message to a. Since 
T describes 21, {«,«'} 2" I implies that m and mi form a noisy pair, whence a 
sends a message — in fact, a non-invertible message — to all Xyi of these elements. 
But since a has star- type au, a sends a non-invertible message to exactly ry^ 
elements having 1-type 7iy. Thus, ry^ > ayj. □ 

We now prove a converse of Lemma 1131 

Lemma 14. Let T be a chromatic frame over S, and let Z > 3mC — 1. I/.7 7 
/ias a Z-solution and T \= <f>* , then there exists a finite structure 21 interpreting 
S sucfc i/iai 21 1= </>* . 

Proof. Let J 7 = (ct, J, 0), let <r = (o\, . . . , c/v), and let w = (wi, . . . , u>/y) be a 
Z-solution of J 7 . In the sequel, we use the symbols on,, Pik, Qjk, and s^k (with 
indices in the appropriate ranges), as specified in Notation |5| and the symbols 
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Ui, Vj and Xw (again, with indices in the appropriate ranges), as specified in 
Definition El Hence, the conditions C1-C6 of Definition 1161 hold. 

For every k (1 < k < N), let A k be a set of cardinality w k , and let A be the 
disjoint union of the A k . Think of A k as the set of elements which 'want' to 
have star-type cr k . In addition, we define for all i (1 < i < L), all i' (1 < i' < L) 
and all j (1 < j < M*): 

Ui =\J{A k | 1 < k < N and o lk = 1} 
X iV = {J{A k | 1 < k < N and o ?;fem . = 1} 
^ = IJ{ Afc I 1 < ^ < iV and q jk = 1}. 

Since T is chromatic, <j.,- fc < 1 for all j (1 < j < M*) and all k (1 < fc < N). 
Thus, for all i, i' and j in the appropriate ranges: 

\Ui\ = u^, \X iv \ = xw, \Vj\ = Vj. 

Think of Ui as the set of elements which 'want' to have 1-type iTi, Xui as the set 
of elements in Ui which do not 'want' to send a message to any element in Ui>, 
and Vj as the set of elements which 'want' to send an (invertible) message of type 
[ij to some other element. We remark that A k C Ui if and only if tp(ofc) = 7T;. 
Moreover, for all j (1 < j < M*), there exists a unique i (1 < i < L) such that 
V} C J7i — namely, that i such that tp 1 (/x J ) = 7Ti. We now convert the domain 
A into a structure 21 in four steps. 

Step 1 (Interpreting the unary predicates and diagonals of binary predicates): 
For every k (1 < k < N) and every a S A k , set tp^fa] = tp (a k ). At the end of 
this step, we have, for every i (1 < i < L) and every a £ Ui, tp 2l [a] = 7r.;. 

Step 2 (Fixing the invertible message- types): For every j (1 < j < M*), let 
j' be such that /ij 1 = fiji. By CI, Vj and Vji are equinumerous. If j' > j, 
pick some 1-1 correspondence between Vj and V}< ; and for every a S Vj, set 
tp 21 ^, a'] = jij, where a' is the element of Vj> corresponding to a G Vj. This 
completes Step 2. We must show that these assignments are meaningful, do 
not clash with Step 1, and do not clash with each other. Suppose then that 
the assignment tp a [a, a'] = \ij is made, and that fij 1 = fiji. Thus, a e Vj 
and a' G Vj' ■ To show that the assignment is meaningful, we must prove that 
a ^ a! . For contradiction, suppose a = a', and let k be such that a G A k . 
But then a k [j] > and a k [j'] > 0, which is impossible by Remark[3 To show 
that the assignment does not clash with Step 1, suppose fij 1 = /-ij>, and let i, 
i' be such that Vj C Ui and Vj' G Ui>. As observed above, ■n.i = tp 1 (/ij) and 
7Ti> = tp 1 (/x J ') = tp 2 (/ij), which conforms to the assignments in Step 1. To 
show that these assignments do not clash with each other, it suffices to prove 
that, if a G Vj D Vh, a' G Vji D Vh> nj 1 = Hj> and fi^ = fih' , then j = h. 
Suppose then that antecedent of this conditional holds; let k and k' be such 
that a G A k and a' G A k > ■ Then a k i [j'] > and a k / [h'\ > 0. Since a k > is a 
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Ui 



U 





Case 1: m = 1 



Case 3: 



> 3mC, 



> 3mC" 



Figure 2: Dealing with non-invcrtible messages. 



star-type, tp^fXj,) = tp x (/i^), whence tp 2 (^) = tp 2 (/Uh)- But cr fe [j] > and 
<Jk[h] > 0, and since <7k is a chromatic star-type, j = /i. Note that, if /jj 1 = fij, 
then V 3 ; = by Remark [SJ Thus, at the end of Step 2, for every element a £ A 
and and every j (1 < j < M*), a sends a (unique) message of type fij to some 
other element if and only if a £ Vj. That is: for all k (1 < k < N), all a £ Ak, 
and all j (1 < j < M*), there are exactly <Tk[j] elements a' £ A such that 
a 1 and tp a [a, a'] has been set to the (invertible) message- type \Xj. We make 
one further observation before proceeding. Suppose that tp a [a, a'] is assigned 
in this step and that a € Ui; we claim that a' $ Xj,n for any i'. To see this, 
suppose a e A^ C Vj and a' G C Vj,, with //J 1 = . If a e Ui, then 
tp 1 (/ij) = tp 2 (/ij') = 7Tj. But crfc'[j] > 0, whence p^y = 0, whence a' ^ Xj/^. 
This observation will be useful in Step 3. 



Step 3 (Fixing the non- invertible message- types): Let i and i' be such that 
1 < i < i' < L. We fix all the non-invertible messages sent, in cither direction, 
between Ui and U^. By C3, either Ui < 1 or Ui > Z; similarly, either Ui, < 1 
or Ui, > Z. We consider five cases. 

Case 1: Ui = 0. In this case, there are no elements of Ui and hence no 2-type 
assignments to be made between elements of Ui and elements of U^ . Note that, 
by C2, s lk = for all k (1 < k < N), whence a k [j] = for all k (1 < k < N) 



and for all j (1 < j < M) such that tp 2 (/ij) 



(Intuitively, no element 



of A — and in particular of 
1-type TTi anyway.) 



'wants' to send a message to an element with 



Case 2: Ui = 1. Let a be the sole element of Ui, and let k be such that 
a £ Ak . We deal first with the assignment of non-invertible messages sent from 
U^ to Ui = {a}. Consider any a' £ Ay C U^. By C2, < 1; hence there 
is at most one value of j in the range 1 < j < M such that tp 2 (/Xj) = 7Tj 
and o~k'[j] > 0. Suppose then that such a j exists. Again, since Sik' < 1, 
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<J k'[j] = 1- If j < M*, then this message has already been dealt with in Step 
2; otherwise, set tp a [a',a] = Since tp 1 (/ij) = 7IV and tp 2 (/Xj) = 7Tj, this 
assignment docs not clash with Step 1. Observe also that, just as in Step 2, if 
this assignment is made, we have, by definition, p^i = 0, so that a' $ X^i. By 
carrying out the same procedure for all a' G U^, we complete the assignment 
of non-invertible messages sent from [/;< to f/j. It remains to deal with the 
non-invertible messages sent from Ui = {a} to Uy. Remembering that a G A^, 
C4 ensures the existence of a subset R C Xin such that \R\ = ry^. For each 
j (M* + 1 < j < M), if tp 2 (/Xj) = 7Ti', select qjk fresh elements a' of R, and 
make the assignment tp a [a,a'] = fij. (There are enough such elements by the 
definition of Since tp 1 (/ij) = 7r,; and tp 2 (/ij) = 7r»', this assignment does 

not clash with Step 1. Moreover, we have observed that tp a [a, a'] has previously 
been assigned (either in this step or in Step 2) only if a! Xin. Thus, these 
assignments do not clash with those made earlier in this step or those made in 
Step 2. The situation is illustrated in Fig. Case 2. 

Case 3: = and U{ > Z. Symmetrical to Case 1. 

Case 4 : u v = 1 an d u.i > Z. Symmetrical to Case 2. 

Case 5: Ui > Z and iiy > Z. Since Z > 3mC — 1, partition Ui into three 
sets Uio, Un, Ui2, each containing at least mC elements; and similarly for Ui'. 
Suppose a G Ui. Then for some h (0 < h < 3), a G Uih- Let k be such that 
a G Ak, and let h' = h + 1 (mod 3). For all j (M* + 1 < j < M), pick a k [j] 
fresh elements a' of t/j'/j/ such that tp a [a, a'] was not assigned in Step 2, and set 
tp a [a, a'] = fij. By RemarkEl we have, for each k (1 < k < N). Yli<j<M a k[j] < 
mC; and since |J7i'^'| > mC, we never run out of fresh elements to pick. In this 
way, we deal with all messages sent from Ui to ; the messages sent from Uy 
to Ui are dealt with symmetrically. It is obvious that these assignments do not 
clash with Step 1 (or Step 2); and the fact that b! = h + 1 (mod 3), ensures 
that they do not clash with each other (even if i = i'), as is evident from Fig. [21 
Case 5. 

Performing these assignments for all pairs i, i' such that 1 < i < i' < L completes 
Step 3. At the end of Step 3, then, for all k (1 < k < N), all a G Ak, and all 
j (1 < 3 < M), there are exactly <Jk[j] elements a' G A such that a ^ a' and 
tp a [a, a'] = ii j. 

Step 4 (Fixing the silent 2-types): Finally, we deal with pairs of distinct el- 
ements a, a' whose 2-type in 21 has not been yet been assigned. Let be 
such that a G Ui and a' G Ui> , and suppose, without loss of generality, that 
i < i' . We claim that (i, i') G /. For suppose otherwise. By C5, we have either 
Ui = 1 or Ui> = 1. Assume the former. Now let k,k' be such that a G Ak 
and a' G Ak> ■ Thus, ow = 1- If Pik 1 = then there is some j (1 < j < M) 
such that qjk' = Cfe'[j] > and tp 2 (/ij) = 7Ti, whence — bearing in mind that 
a is the unique element of Ui — tp a [a,a'] will certainly have been assigned in 
Step 2 (if [ij is an invertible message-type) or in Step 3 Case 2 (if \ij is a non- 
invertible message- type), contradicting our supposition. So we may assume that 
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Pik> = 1 and hence Oi'k'Pik' = 1- That is: a G -XVj. But = avj. And 

by C6, ajj/j < ry^. Yet in Step 3 (Case 2), ry^ elements of Xyi were chosen 
to receive messages from a. Hence a' must be among these elements, so that 
tp a [a, a'] has already been assigned, again a contradiction. The case where 
Ui' < 1 proceeds symmetrically. Thus, we have established that, if tp a [a,a'] 
has not yet been assigned, then {i, i'} £ /, so that we can make the assignment 
tp a [a, a'] = 6({i, i'}). Since tp 1 (6'({i, i'})) = ■Ki and tp 2 (0({«, i'})) = 7T,', there is 
no clash with Step 1. Evidently, we can proceed in this way until all remaining 
2-types have been assigned. 

This completes the construction of 21. The only 1- types realized in 21 are the 1- 
types tp(<7fc) (where 1 < k < TV). The only message- types realized in 21 are those 
Hj such that <Jk[j] > for some k. And the only silent 2-types realized in 21 are 
the 9({i,i'}) for e I. Since T |= </>*, we have 21 |= Vxa AVxVy(/3 Vi w y). 

Moreover, for all fc (1 < < N), and for all a G Afe, we have st a [a] = cr^ . 
Since J 7 |= 0*, we have, for all a € A, ct a [a] = (Ci, . . . , C m ); in other words, 
21 H /\i< h < m Vx3=c h y{fh{x,y) Ax 96 y). Hence, 21 (= 0*- □ 

We can now employ a standard result to bound the complexity of determin- 
ing whether a given frame has a ^-solution. 

Lemma 15. Let T he a Y -bounded, N -dimensional frame over S, and Zei Z 
be an integer. Then T has a Z-solution if and only if it has a Z-solution w 
such that every component of w is bounded by some (fixed) singly exponential 
function of the quantity L + AI* + N + log Y + log Z . 

Proof. This follows immediately from the well-known result (Papadimitriou 
that, if an integer programming problem has a solution at all, then it has a 
solution all of whose components are bounded by a singly exponential function of 
the size of the problem (encoded in the obvious way). But the conditions C1-C6 
in Dcfinition lltjl amount to a disjunction of integer programming problems whose 
sizes are all bounded by a polynomial function ofL + M* + A + logy-|- log Z . □ 

Theorem 1. The problem Fin-Sat-C 2 is m NEXPTIME. 

Proof. Given <fi, let to, f±, . . . , f m , C\, . . . , C m and </>* be as in Lemma J] and 
let C = maxi< ft < m C h ; thus C < 2^*H. Let Z = max(3mC - 1, (mC + l) 2 ). 
Let E be the signature of <f>* together with log((mC) 2 + 1) + \ogZ (rounded 
up) additional unary predicates, regarded as a counting signature by taking 
the counting predicates of E to be fx, ... , f m . Denote the total size of E by 
s: evidently, s is bounded by a polynomial function of \\4>*\\. Finally, let X — 
4 s . (16 s + 1)(C+ l) sm . 

Since C < 2^ C is bounded by a singly exponential function of \\<j)\\. Thus, 
the problem of determining the satisfiability of <fi over structures of size C or 
less is obviously in NEXPTIME. By Lemmanthcn, it suffices to show that the 
satisfiability of (jf can be decided nondeterministically in time bounded by a 
singly exponential function of \\<t>*\\. 
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We claim that <fi* is finitely satisfiable if and only if there exists a chromatic, 
C-bounded frame T over £ of dimension N < X, such that T has a Z-solution 
and T |= <j>* . For suppose <fi* is finitely satisfiable. By Lemmas [21 and |3 <f>* 
has a finite, chromatic. Z-diffcrcntiated model 21 interpreting S. By Lemma ITU1 
we may assume without loss of generality that 21 is also X-sparse. Let J- he a. 
frame over E describing 21. By Lcmma^J T is chromatic, of dimension N < X 
and C-bounded. By Lemma IT2l T |= <j>* . Since Z > (mC + l) 2 , Lemma 113*1 
guarantees that J- has a Z-solution. Conversely, suppose J- is a chromatic frame 
over £ such that J- has a Z-solution and J- \= 4>* . Since Z > 3mC— 1, Lemmal*""?"! 
guarantees that 0* is finitely satisfiable. 

By Lemma El then, <f>* is satisfiable if and only if there exists a chromatic, C- 
bounded frame T over £ of dimension N < X and a vector w of positive integers 
bounded by some doubly exponential function of \\(f>* ||, such that T |= 0* and w 
is a Z-solution of JF. Using the standard binary encoding of integers, it is easy 
to write down J- and w in a number of bits bounded by a singly exponential 
function of \\<j>*\\, and to check whether they satisfy the above conditions in time 
bounded by a singly exponential function of \\4>*\\- This completes the proof. □ 

We note in passing that the above proof yields a small model property for 
finitely satisfiable C 2 -formulas. 

Corollary 1. Let <\> be a formula of C 2 . If <f> is finitely satisfiable, then it is 
satisfiable in a structure of size bounded by a doubly exponential function of\\<p\\. 

Proof. The structure built in Lemma fHI from T and its Z-solution w has domain 
of cardinality wi + ■ ■ ■ + wn . □ 

Notice that the complexity result of Theorem is better than one might 
naively expect on the basis of the small model property of Corollary ^ Never- 
theless, the bound of Corollary^is optimal in the sense that there is a sequence 
{4>i} of finitely satisfiable formulas of C 2 whose size grows as a polynomial func- 
tion of i, but whose smallest satisfying structures grow as a doubly exponential 
function of i (Gradel and Otto [2], p. 317). 

5 Deciding satisfiability 

Having established the complexity of determining finite satisfiability in C 2 , we 
now turn to the corresponding (general) satisfiability problem. In fact, there is 
almost no further work to do. 

Notation 6. Let N* denote the set NU{H }. We extend the ordering > and the 
arithmetic operations + and • from N to N* in the obvious way. Specifically, we 
define H > n for all n g N; we define H + Ko = *V^o = and 0-H = H o -0 = 0; 
we define n + Ko = Ho + n = Ho for all n £ N; and we define n ■ Ko = Ho • n = Ho 
for all n £ N such that n > 0. Under this extension, > remains a total order, 
and +, • remain associative and commutative. 



5 DECIDING SATISFIABILITY 



23 



A system of linear equalities and inequalities denning an integer program- 
ming problem can of course be re-interpreted so that solutions are sought not 
over N but over N*. (We always assume that the coefficients occurring in such 
problems are in N.) As an example, the single inequality x\ > x\ + 1 has no 
solutions over N, but it does have a solution over N*, namely, x\ = Ho. 

Lemmas l2T ll(JI apply to both finite and infinite structures. Furthermore, the 
definition of a frame and its relationship to the (finitely branching) structures 
it describes make no reference to the cardinalities of those structures, and Lem- 
mas ^2 El again apply generally. The concept of a Z-solution introduced in 
Definition 1 1 61 requires extension, however. 

Definition 17. Let £ and T = {a, I, 8) be as in Definition 1161 and let Z be a 
positive integer. Let w — (wi, . . . ,wn) be a vector of non-zero elements of N*. 
We say that w is an extended Z-solution of T if w satisfies the conditions of 
Definition^] with the arithmetic interpreted over N* as specified in Notation® 

We must check that the obvious analogues of Lemmas 1131 and PHI hold: 

Lemma 16. Let % be a Y -branching and Z -differentiated structure interpreting 
£, with Z > (mY + l) 2 . Let T = (a, I, 6) be a frame over S. If T describes 21, 
then T has an extended Z-solution. 

Lemma 17. Let T be a chromatic frame over £, and let Z > 3mC— 1. If T has 
an extended Z-solution and T \= <jf , then there exists a structure 21 interpreting 
S such that 21 1= <fi* . 

The proofs are exactly the same as in the finite case. Note that the variables 
Uj, Vj and xw as well as the Wk may now take the value Ho; by contrast, the 
coefficients Oik, Pik, Qjk, rik and remain finite. Remark |H1 generalizes unprob- 
lematically to countably infinite structures, so that the quantities Uj, Vj and xu> 
mentioned in Definition 1161 continue to have their familiar interpretations. The 
proofs of Lemmas 1161 and IT7I then proceed exactly as before. 

There is one final hurdle to overcome. The proof of Lemma ll5l used a well- 
known result bounding solutions of integer programming problems. Since we 
arc dealing with N* -programming problems, we need the following extension of 
that result. 

Lemma 18. Let $ be a finite set of linear inequalities of the form 

a + aiXi + • • ■ + a n X n < b + bixi + • • ■ + b n x n 

in variables x±, . . . ,x n . Here, all coefficients are assumed to be in N. We take 
the size of <!>, denoted ||$||, to be measured in the usual way, assuming binary 
encoding of integers. If $ has a solution over N* , then $ has a solution over 
N* such that all finite values are bounded by some (fixed) singly exponential 
function of ||$||. 
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Proof. Suppose that <& has a solution over N* . Re-order the variables if necessary 
so that this solution has the form aK , with a = ai, . . . , a*. G N fc for some k 
(0 < k < n) and R a ( n — fc)-tuple of H s. Say that an inequality in <f> does 
not involve the variable ai^ if the corresponding coefficients Oj and 6^ are both 
zero. Let be the set of inequalities in $ involving none of the Xk+i, ■ ■ ■ , £„. 
Thus, ^ , considered as a problem in variables x\, . . . ,Xk, has a solution a over 
N, whence it has has a solution a! bounded by some singly exponential function 
of 1 1 v^H (and hence of ||$||). But then it is easy to see that a' No is a solution of 
$. □ 

Theorem 2. The problem Sat-C 2 is in NEXPTIME. 

Proof. Exactly as for Theorem ^ noting that, by Lemma ITH1 the existence of 
extended Z-solutions for a frame T can be checked nondeterministically in time 
bounded by an exponential function of ||</>*||. □ 

Obviously, there is no interesting small model property for satisfiable C 2 - 
formulas corresponding to Corollarj^ however, we have the next best thing. 

Corollary 2. Let <f> be a formula of C 2 . Then there exist integers X and W , 
with X bounded by a singly exponential function of and W by a doubly 
exponential function of \\4>\\ such that, if '(f) is satisfiable, then it is satisfiable in 
an X-sparse structure in which every star-type is realized either infinitely often 
or at most W times. 



References 

[1] Erich Gradel and Martin Otto. On logics with two variables. Theoretical 
Computer Science, 224(l-2):73-113, 1999. 

[2] Erich Gradel, Martin Otto, and Eric Rosen. Two-variable logic with count- 
ing is decidable. In Proceedings of the 12th IEEE Symposium on Logic in 
Computer Science, pages 306-317. IEEE Online Publications, 1997. 

[3] Martin Otto. Two-variable first-order logic over ordered domains. Journal 
of Symbolic Logic, 66(2):685-702, 2001. 

[4] Leszek Pacholski, Wieslaw Szwast, and Lidia Tendera. Complexity of two- 
variable logic with counting. In Proceedings of the 12th IEEE Symposium 
on Logic in Computer Science, pages 318-327. IEEE Online Publications, 
1997. 

[5] Leszek Pacholski, Wieslaw Szwast, and Lidia Tendera. Complexity results 
for first-order two-variable logic with counting. SIAM Journal on Comput- 
ing, 29(4):1083-1117, 1999. 

[6] Christos H. Papadimitriou. On the complexity of integer programming. 
Journal of the Association for Computing Machinery, 28(4):765-768, 1981. 



